Designing optimal transport networks 
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We investigate the optimal design of networks for a general transport system. Our network is 
built from a regular two-dimensional [d — 2) square lattice to be improved by adding long-range 



connections (shortcuts) with probability P^j 



where rij is the Euclidean distance between 




sites i and j, and a is a variable exponent. We introduce a cost constraint on the total length of the 
additional links and find optimal transport in the system for a — d-\-l. Remarkably, this condition 
remains optimal, regardless of the strategy used for navigation, being based on local or global 
knowledge of the network structure, in sharp contrast with the results obtained for unconstrained 
navigation using global or local information, where the optimal conditions are a = and a — d^ 
respectively. The validity of our theoretical results is supported by data on the US airport network, 
for which a ~ 3.0 was recently found [Bianconi et ai, arXiv:0810.44 12 (2008)]. 

PACS numbers: 89.75.Hc,02.50.-r,05.40.Fb,89.75.Fb,05.60.-k 



The interplay between topology and dynamics in com- 
plex systems represents the focus of many studies in 
different fields of research with important scientific and 
technological applications. Due to their enormous po- 
tential to represent the intricate topology of numerous 
systems in nature, complex networks [J [H, @] have re- 
cently been used as substrates in combination with a 
plethora of dynamical models to describe the behavior 
of biological, social, chemical, physical and technological 
networks [1, [H, @]. Much attention has been dedicated 
to theproblem of navigation in complex network geome- 
tries 0, i, i, EO, EH, IB, 0, E 0. In most cases, 
the influence of the underlying network geography on the 
performance of the transport process is investigated as- 
suming that only local information is available for navi- 
gation [7j,y,|9|,ll0|,ll7|. 

For many navigation problems of interest in science 
and technology, global rather than local information is 
required, i.e., any source node s possesses the knowledge 
of the entire network topology. In this situation, the av- 
erage shortest path {£) from source to target becomes 
the relevant navigation variable to be optimized. For ex- 
ample, in a subway network, such as in Manhattan, the 
travel routes should be planned or changed in such a way 
as to minimize the travel time for a given limited recon- 
struction cost. This task is performed by considering the 
whole structure of the network in terms of its nodes and 
links, namely, by knowing the location of all subway sta- 
tions, their connections and the shortest path between 
any two stations. If we now consider an underlying net- 
work of streets and avenues over which one has to plan 
or improve an existing subway network, and if the aim is 
to minimize the average travel time between its stations, 
the search for an optimal strategy to add new connections 
in the network for a given budget should therefore play 
a key role. Here we show that the imposition of a cost 
constraint, which to the best of our knowledge has not 
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FIG. 1: (Color online) Connections of a single node i. Each 
node i has four short-range connections to its nearest neigh- 
bors (a, b, c, and d). A long-range directed connection may 
be placed to a random chosen node j. The node j is selected 
with probability proportional to rij~^, where r^j is the dis- 
tance measured as the number of connections separating the 
nodes in the underlying lattice. 



been considered for optimal navigation, represents a cru- 
cial ingredient in the design and development of efficient 
navigation networks. 

Consider the case of an existing subway network which 
needs improvements [13, 113] • The financial cost to build 
up a large number of new direct connections between 
distant stations (i.e., non- neighboring sites) can make 
it prohibitive, since only limited resources are normally 
available for this task. This problem can be modeled by 
the following system. In a 2-dimensional regular square 
lattice, with all N = L'^ sites present, each site i is con- 
nected with its four nearest neighbors. The sites repre- 




FIG. 2: (Color online) Shortest-path length, ^, from each node 
to the central node in the network for different values of a. 
In this case we impose a constraint in the length of the long- 
range connections. The sum of the length of these connections 
is limited, A = J^^ij — where N is the number of nodes 
in the underlying lattice. The network model is constructed 
from a square lattice with nodes, with L = 256. We can 
clearly observe that the best condition for shortest path length 
is obtained for a = 3. 



sent the stations and the bonds represent the routes of 
the subway (see Fig.[T]). In our model, pairs of sites ij are 
then randomly chosen to receive long-range connections 
with probability proportional to ^^j", where r^j is the Eu- 
clidean distance between sites i and j. Finally, the addi- 
tion of long-range connections to the system stops when 
their total length (cost), X^rij, reaches a given value A. 
Since a controls the average length of the long-range con- 
nections, we obtain that, for a fixed value of A, and small 
values of a, longer connections, but fewer in number can 
be added due to the imposed total length limit. We there- 
fore expect that an optimal navigation condition must be 
revealed as a trade-off between the length and the num- 
ber of connections added to the system. 

Optimal navigation with the presence of long-range 
links in a lattice network without constraints was stud- 
ied recently by Kleinberg [7]. Here we show for the first 
time that a rather different behavior can be observed for 
this problem when realistic constraints on total length 
are imposed on the process of adding long-range connec- 
tions. To better demonstrate the competition between 
total length and number of links, we generate a single net- 
work realization {L = 256) for a given value of a and com- 
pute the shortest-path length, ^, from each node in the 
network to its central node. This calculation is performed 
as follows. Once we choose the root node (e.g., the central 
one), we visit all its neighbors, including the neighboring 
nodes connected by long-range connections. These vis- 
ited nodes are classified as shell one nodes, meaning that 
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FIG. 3: (Color online) Average shortest path length {£) as a 
function of a. There is a constraint in the total length of the 
long-range connections, A = ^rij = L^, where L is the size 
of the underlying square lattice. We find that the optimal 
shortest-path is achieved for a = 3. With the restriction 
on total length, the number of long-range connections are 
not fixed (e.g.. with a = 0, large long-range connections 
become frequent, which reduces the total number of long- 
range connections.) To obtain these results, we simulated 
10, 000 realizations for L = 512, 3500 realizations for L = 
1024 and 2048, and 25 realizations for L = 4096. 



they are only one time step away from the root node [l9|]. 
After that, we visit all the neighbors of these nodes not 
visited before and classify them as shell two nodes. Fol- 
lowing this procedure for all network nodes, we obtain the 
£ values (time) for each node to be reached from the root 
node. Figure [2] shows the contour plots representation 
of the i values performed for four different values of the 
parameter a. For a = 1 and 2 the number of long-range 
connections is small. As a result, only a few little islands 
sparsely dispersed in these networks are really close to 
their central nodes (only a few short and/or long-range 
connections away) . For a = 3 the added long-range links 
are shorter, but more numerous, thus substantially de- 
creasing the shortest path over the whole network. For 
a = 4, due to the very short length size of the added 
connections, only a limited region surrounding the cen- 
tral node displays a reduced shortest path. Sites which 
are further away from the origin have significantly larger 
path length i to the origin. 

We extract more quantitative information about this 
navigation problem by performing extensive simulations 
for different values of a and many realizations of differ- 
ent system sizes. In each case, the average shortest path 
{£) is calculated over all realizations, considering all the 
shortest distances between each pair of nodes. We as- 
sume that the total length (cost) is proportional to the 
total length of the links in the underlying network, i.e., 
A = AL^, where A is a constant. That is, the budget to 
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FIG. 4: (Color online) In (a) we show the average shortest 
path length {€) as a function of the lattice size L for the square 
lattice. The constraint in the total length of the long-range 
connections is A = . The curve with a = 3 increases slower 
with L compared to any other value of a. In the inset, the 
plot of the successive slopes 5s obtained from logiQ(^) ver- 
sus log^^Q L reinforces the display of power- law behavior of 
{€) with L for a 7^ 3. The plot of the successive slopes 75 
obtained from \og-^Q{t) versus log^Q(log^Q L) shown in (b) in- 
dicates that {€) increases as a power of the logarithm of L for 
the optimal condition a = 3. 



improve the system is a fraction of the cost of the current 
network (without long-range connections) [20]. 

The results presented in Fig. [3] clearly indicate the pres- 
ence of a minimum {t) for different system sizes at the 
same value of the exponent a = 3, where optimal naviga- 
tion is achieved with (global) knowledge of the shortest 
paths and cost limitations. The way in which {€) scales 
with system size L, however, seems to follow rather dif- 
ferent behaviors, depending on the value of a. We tested 
two possible forms for {€) vs TV, a power-law and a power 
of log N . As shown in the main plot of Fig. [4^, our re- 
sults for Of 7^ 3 suggest that the shortest path {tj follows a 
power-law with system size L. This is supported by the 
plot of successive slopes 5s obtained from log;Lo(^) 
sus L, which are almost invariant, as shown in the 
inset of Fig. 2^. In contrast, in the case of a = 3, the 
increase with L of {tj appears to be less rapid than a 
power-law. Interestingly, the successive slopes 75 ob- 
tained from logio(^) versus \ogiQ{\ogiQ L) ^ as presented 
in Fig. [3)3, indicate that {tj increases as a power of the 
logarithm of L, {€) ^ logio rather than a power of 
L, only for a = 3. This provides clear support for the 
fact that in the optimal condition, a = 3, the transport 
will improve even further as L increases, as suggested by 
Fig. El 

We also studied our model for a one-dimensional lattice 
and observed similar behavior. The optimal condition we 
obtained in this case at a = 2 (data not shown) leads 
us to conjecture that the optimal value is obtained at 



a = (i + 1, where d is the dimension of the underling 
lattice. Note that the Kleinberg result was extended to 
fractals Q, where the optimal exponent is found to be 
a = df, namely, the fractal dimension of the substrate. 
The {£) dependence on L for different a for the Kleinberg 
model was recently derived analytically Q. 

In the following, we present analytical arguments 
showing that a = 3 is indeed the only case where log- 
arithmic scaling of {£) with L can occur, while for a 7^ 3 
a power-law with L should exist. By arbitrarily fixing the 
cost parameter to A = AL^, we obtain that p ~ (^)~^, 
where p is the density and (r) is the average length of the 
added long-range connections. Since (r) ~ r'^~^dr^ it 
follows that for 2 < a < 3, p ~ and for a < 2, (r) 

is limited by the network size leading to p ~ Thus, 
for all values of a < 3 the density of the long-range links 
added, due to the constraint, decreases as a power-law 
with L. As a consequence of this power-law decrease in 
density, {£) must increase as a power of L. To see this 
we argue that {£) is bounded by the relation {£) > p~^^^. 
The right hand side, appears for the case of the 

small world model, where a = 0, with a fixed concen- 
tration of links, {£) ~ p~^/^lnL [3]. Since for the case 
< a < 3, (£) decreases with increasing a, the bound 
{£) > is rigorous and {£) in this range must 

scale as a power of L. For a > 3 and sufficiently large 
networks, (r) is finite and the density becomes indepen- 
dent of the system size, i.e., p ^ L^. Thus, the effect of 
the constraint A on navigation should become negligible. 
However, the finite value of (r) suggests that long-range 
links can be neglected and therefore {£) should scale as a 
power of L. Thus, it follows that only for a = 3, {£) can 
scale logarithmically with L, as suggested by our numer- 
ical simulations (see Fig. [3j). 

It is important to note that our global navigation 
scheme with {£) can be considered as a lower bound to 
any other transport navigation process. For example, 
a strategy based on purely local knowledge of the net- 
work structure will necessarily perform worse than any 
other with global information. In Ref. [7], for example, 
the greedy algorithm is introduced as a paradigm based 
on local information, where the traveler, when leaving 
a node, chooses to move to the one among its neighbors 
which has the smallest Manhattan distance to the target. 
Kleinberg found that a = 2 is the optimal value in the 
navigation with the greedy algorithm [7j. We next ask, 
what would be the optimal a for the greedy algorithm 
when cost restriction A = AL^ is imposed? We find also 
for the greedy algorithm that the optimal value is a = 3. 
This is shown in Fig. [5l where we plot the average de- 
livery length {£g) that a message travels with only local 
information of the system geometry. The message is sent 
from the source node s to the target node t through a 
network generated with the constraint A = L^. Remark- 
ably, the presence of a minimum also at a ~ 3 shows 
that the type of information (local or global) used by the 
message holder to pass it through the system during the 
navigation process becomes unimportant if the network 
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5: (Color online) The characteristic average delivery 
{ig)/L as a function of a for navigation with the greedy 



algorithm. The cost A involved to add long-range connections 
changes the behavior of the density of long-range connections. 
As a result of that, a minimum is observed at a ~ 3. Each 
data point is a result of 4000 simulations and the cost A is 
fixed at . 



is constructed under length (cost) limitations. However, 
the two mechanisms display very different and distinct 
behaviors regarding the scaling with system size. While 
we observe logarithmic growth for the optimal condition 
a = 3, in the case of global information, the time to 
reach the source, with the greedy algorithm and with 
cost constraint, appears to increase linearly with size for 
all values of a. The linearity of {£g) with L is observed 
in the scaling collapse (Fig. [5]) of the curves of {ig)/L vs. 
a. 

In summary, we have investigated the effect of intro- 
ducing a cost constraint in the optimal design of a trans- 
portation network. Our results show that, regardless of 
the strategy used by the traveler, based on local or global 



knowledge of the network structure, the best transporta- 
tion condition is obtained with an exponent a = (i + 1, 
where d is the topological dimension of the underlying 
lattice. The results recently reported by Bianconi et 
al. [21] on the US airport network provide striking sup- 
port for the validity of our optimal exponent a = 3. The 
fact that the probability of a flight connection within 
US decays as a power-law with the distance between air- 
ports, r~", where a = 3.0 ± 0.2, reveals the optimized 
aspect of the network under the conditions of uniform 
geographical availability (for customer satisfaction) and 
cost limitations (for airline companies profit). The re- 
sult a = 3 is in sharp contrast with the results obtained 
for unconstrained systems with global and local informa- 
tion, where the optimal conditions are a = fll'] and 
a = d [7, 8], respectively. The contrast between the op- 
timal results is even more dramatic. While in the uncon- 
strained case the mean length of a link diverges^ we find 
that when cost is considered the mean length is finite. In 
the case where the traveler has global knowledge of the 
network, and is able to identify the shortest path for nav- 
igation, we obtain a slow (logarithmic) growth with size 
for the transit time at the optimal condition. A different 
picture is obtained if the traveler has only local knowl- 
edge of the network. For example, in the case where the 
transportation path is decided based on the Manhattan 
distance to the target, we obtain a linear growth of the 
transit time with system size, for all values of the ex- 
ponent a. Finally, our results suggest that the idea of 
introducing a cost constraint in the navigation problem 
offers a different and more realistic theoretical framework 
to understand the evolving topologies of other important 
complex network structures in nature, such as subways, 
trains, or the Internet. 
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